⚡ Cache Optimization - hello · Scour

Utility-Based Cache Partitioning: Making Shared Caches Smarter in Multi-Core Systems

dev.to·3d·

Discuss: DEV

🧩Cache Partitioning

Memory Caching: RNNs with Growing Memory

arxiv.org·22h

Optimizing Recommendation Systems with JDK’s Vector API

netflixtechblog.com·1h

⚡SIMD Optimization

Time is of the essence: EBR in High-Performance Databases

dev.to·21h·

Discuss: DEV

♻️Epoch-Based Reclamation

The Hidden Optimization Behind Modern LLMs: Grouped Query Attention Explained

pub.towardsai.net

·10h

Show HN: Benchmarking the Keep memory system with LoCoMo

keepnotes.ai·9h·

Discuss: Hacker News

🧠Memory Models

Quieno/izalloc: Drop-in, dependency-free, minimal memory allocator in C that passes 42 Shool's norm.

github.com·1d·

Discuss: r/C_Programming

🧩Mimalloc Internals

Right-sizes LLM models to your system's RAM, CPU, and GPU

news.ycombinator.com·20h·

Discuss: Hacker News

🔗Intrusive Containers

The ongoing quest for atomic buffered writes

lwn.net

·4h

🚧Memory Barriers

TurboSparse Efficiency: Achieving 97% Parameter Sparsity in Mixtral-47B

hackernoon.com·18m

Optimal Heterogeneous Memory Configs for AI Tasks Under Specified Performance Metrics (Stanford, UCSC)

semiengineering.com·1d

Data Driven Optimization of GPU efficiency for Distributed LLM Adapter Serving

arxiv.org·22h

The volatile cache trap: Why turning off Windows buffer flushing will silently corrupt your SSD

howtogeek.com·8h

🚀Software Prefetching

Beyond Pandas: Architecting High-Performance Python Pipelines

hackernoon.com·6h

Entrpi/eemicrogpt: The most extreme way to train a GPT in pure, dependency-free C. 1546x faster than Python. Optimized for Apple Silicon with SME2.

github.com·11h·

Discuss: Hacker News

Scoped Resources in C with `__attribute__((mulle_confined_loop))`

mulle-kybernetik.com·16h

🦀Rust Macros

The Shape of Code » Relative performance of computers since the 1990s

shape-of-code.com·1d

🚀Performance

Linux 7.0 Shows Off Nice Performance Gains For Databases In Small AMD EPYC Servers

phoronix.com·11h

Rare Huawei-ByteDance alliance unveils RRAM AI chip delivering 66x CPU speed at ISSCC 2026

digitimes.com·10h

and we have new evidence

quarkus.io·1h

🚀Performance

Loading more...